Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource

Identifieur interne : 000468 ( Main/Exploration ); précédent : 000467; suivant : 000469

Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource

Auteurs : Thomas J. Sharpton [États-Unis] ; Guillaume Jospin [États-Unis] ; Dongying Wu [États-Unis] ; Morgan Gi Langille [Canada] ; Katherine S. Pollard [États-Unis] ; Jonathan A. Eisen [États-Unis]

Source :

RBID : PMC:3481395

Abstract

Background

New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences.

Results

We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as “Sifting Families,” or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology–based analyses.

Conclusions

We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/).


Url:
DOI: 10.1186/1471-2105-13-264
PubMed: 23061897
PubMed Central: 3481395


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource</title>
<author>
<name sortKey="Sharpton, Thomas J" sort="Sharpton, Thomas J" uniqKey="Sharpton T" first="Thomas J" last="Sharpton">Thomas J. Sharpton</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158</wicri:regionArea>
<wicri:noRegion>94158</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Jospin, Guillaume" sort="Jospin, Guillaume" uniqKey="Jospin G" first="Guillaume" last="Jospin">Guillaume Jospin</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">UC Davis Genome Center, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>UC Davis Genome Center, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Wu, Dongying" sort="Wu, Dongying" uniqKey="Wu D" first="Dongying" last="Wu">Dongying Wu</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">UC Davis Genome Center, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>UC Davis Genome Center, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I7">Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598</wicri:regionArea>
<wicri:noRegion>94598</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Langille, Morgan Gi" sort="Langille, Morgan Gi" uniqKey="Langille M" first="Morgan Gi" last="Langille">Morgan Gi Langille</name>
<affiliation wicri:level="1">
<nlm:aff id="I3">Department of Biochemistry & Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Department of Biochemistry & Molecular Biology, Dalhousie University, Halifax, Nova Scotia</wicri:regionArea>
<wicri:noRegion>Nova Scotia</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Pollard, Katherine S" sort="Pollard, Katherine S" uniqKey="Pollard K" first="Katherine S" last="Pollard">Katherine S. Pollard</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158</wicri:regionArea>
<wicri:noRegion>94158</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I4">Department of Epidemiology & Biostatistics, Institute for Human Genetics, University of California San Francisco, San Francisco, CA, 94158, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Epidemiology & Biostatistics, Institute for Human Genetics, University of California San Francisco, San Francisco, CA, 94158</wicri:regionArea>
<wicri:noRegion>94158</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Eisen, Jonathan A" sort="Eisen, Jonathan A" uniqKey="Eisen J" first="Jonathan A" last="Eisen">Jonathan A. Eisen</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">UC Davis Genome Center, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>UC Davis Genome Center, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I5">Deptartment of Evolution and Ecology, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Deptartment of Evolution and Ecology, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I6">Deptartment of Medical Microbiology and Immunology, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Deptartment of Medical Microbiology and Immunology, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I7">Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598</wicri:regionArea>
<wicri:noRegion>94598</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">23061897</idno>
<idno type="pmc">3481395</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481395</idno>
<idno type="RBID">PMC:3481395</idno>
<idno type="doi">10.1186/1471-2105-13-264</idno>
<date when="2012">2012</date>
<idno type="wicri:Area/Pmc/Corpus">000326</idno>
<idno type="wicri:Area/Pmc/Curation">000326</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000379</idno>
<idno type="wicri:Area/Ncbi/Merge">000362</idno>
<idno type="wicri:Area/Ncbi/Curation">000362</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000362</idno>
<idno type="wicri:Area/Main/Merge">000469</idno>
<idno type="wicri:Area/Main/Curation">000468</idno>
<idno type="wicri:Area/Main/Exploration">000468</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource</title>
<author>
<name sortKey="Sharpton, Thomas J" sort="Sharpton, Thomas J" uniqKey="Sharpton T" first="Thomas J" last="Sharpton">Thomas J. Sharpton</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158</wicri:regionArea>
<wicri:noRegion>94158</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Jospin, Guillaume" sort="Jospin, Guillaume" uniqKey="Jospin G" first="Guillaume" last="Jospin">Guillaume Jospin</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">UC Davis Genome Center, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>UC Davis Genome Center, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Wu, Dongying" sort="Wu, Dongying" uniqKey="Wu D" first="Dongying" last="Wu">Dongying Wu</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">UC Davis Genome Center, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>UC Davis Genome Center, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I7">Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598</wicri:regionArea>
<wicri:noRegion>94598</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Langille, Morgan Gi" sort="Langille, Morgan Gi" uniqKey="Langille M" first="Morgan Gi" last="Langille">Morgan Gi Langille</name>
<affiliation wicri:level="1">
<nlm:aff id="I3">Department of Biochemistry & Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Department of Biochemistry & Molecular Biology, Dalhousie University, Halifax, Nova Scotia</wicri:regionArea>
<wicri:noRegion>Nova Scotia</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Pollard, Katherine S" sort="Pollard, Katherine S" uniqKey="Pollard K" first="Katherine S" last="Pollard">Katherine S. Pollard</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The J. David Gladstone Institutes, University of California San Francisco, San Francisco, CA, 94158</wicri:regionArea>
<wicri:noRegion>94158</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I4">Department of Epidemiology & Biostatistics, Institute for Human Genetics, University of California San Francisco, San Francisco, CA, 94158, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Epidemiology & Biostatistics, Institute for Human Genetics, University of California San Francisco, San Francisco, CA, 94158</wicri:regionArea>
<wicri:noRegion>94158</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Eisen, Jonathan A" sort="Eisen, Jonathan A" uniqKey="Eisen J" first="Jonathan A" last="Eisen">Jonathan A. Eisen</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">UC Davis Genome Center, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>UC Davis Genome Center, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I5">Deptartment of Evolution and Ecology, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Deptartment of Evolution and Ecology, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I6">Deptartment of Medical Microbiology and Immunology, University of California, Davis, Davis, CA, 95616, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Deptartment of Medical Microbiology and Immunology, University of California, Davis, Davis, CA, 95616</wicri:regionArea>
<wicri:noRegion>95616</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I7">Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598</wicri:regionArea>
<wicri:noRegion>94598</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2012">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences.</p>
</sec>
<sec>
<title>Results</title>
<p>We implemented fully automated and high-throughput procedures to
<italic>de novo</italic>
cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as “Sifting Families,” or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology–based analyses.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (
<ext-link ext-link-type="uri" xlink:href="http://edhar.genomecenter.ucdavis.edu/sifting_families/">http://edhar.genomecenter.ucdavis.edu/sifting_families/</ext-link>
).</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Koonin, Ev" uniqKey="Koonin E">EV Koonin</name>
</author>
<author>
<name sortKey="Wolf, Yi" uniqKey="Wolf Y">YI Wolf</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Perna, Nt" uniqKey="Perna N">NT Perna</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tettelin, H" uniqKey="Tettelin H">H Tettelin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rasko, Da" uniqKey="Rasko D">DA Rasko</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wu, D" uniqKey="Wu D">D Wu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yooseph, S" uniqKey="Yooseph S">S Yooseph</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tatusov, Rl" uniqKey="Tatusov R">RL Tatusov</name>
</author>
<author>
<name sortKey="Koonin, Ev" uniqKey="Koonin E">EV Koonin</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Consortium, Tu" uniqKey="Consortium T">TU Consortium</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kanehisa, M" uniqKey="Kanehisa M">M Kanehisa</name>
</author>
<author>
<name sortKey="Goto, S" uniqKey="Goto S">S Goto</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lima, T" uniqKey="Lima T">T Lima</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Meyer, F" uniqKey="Meyer F">F Meyer</name>
</author>
<author>
<name sortKey="Overbeek, R" uniqKey="Overbeek R">R Overbeek</name>
</author>
<author>
<name sortKey="Rodriguez, A" uniqKey="Rodriguez A">A Rodriguez</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Punta, M" uniqKey="Punta M">M Punta</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Haft, Dh" uniqKey="Haft D">DH Haft</name>
</author>
<author>
<name sortKey="Selengut, Jd" uniqKey="Selengut J">JD Selengut</name>
</author>
<author>
<name sortKey="White, O" uniqKey="White O">O White</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Thomas, Pd" uniqKey="Thomas P">PD Thomas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Krishnamurthy, N" uniqKey="Krishnamurthy N">N Krishnamurthy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Klimke, W" uniqKey="Klimke W">W Klimke</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Powell, S" uniqKey="Powell S">S Powell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Friedberg, I" uniqKey="Friedberg I">I Friedberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sharpton, Tj" uniqKey="Sharpton T">TJ Sharpton</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Inskeep, Wp" uniqKey="Inskeep W">WP Inskeep</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Enright, Aj" uniqKey="Enright A">AJ Enright</name>
</author>
<author>
<name sortKey="Van Dongen, S" uniqKey="Van Dongen S">S Van Dongen</name>
</author>
<author>
<name sortKey="Ouzounis, Ca" uniqKey="Ouzounis C">CA Ouzounis</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, K" uniqKey="Liu K">K Liu</name>
</author>
<author>
<name sortKey="Linder, Cr" uniqKey="Linder C">CR Linder</name>
</author>
<author>
<name sortKey="Warnow, T" uniqKey="Warnow T">T Warnow</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mcdowall, J" uniqKey="Mcdowall J">J McDowall</name>
</author>
<author>
<name sortKey="Hunter, S" uniqKey="Hunter S">S Hunter</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Brenner, Se" uniqKey="Brenner S">SE Brenner</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sun, S" uniqKey="Sun S">S Sun</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Edgar, Rc" uniqKey="Edgar R">RC Edgar</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Guindon, S" uniqKey="Guindon S">S Guindon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Eddy, Sr" uniqKey="Eddy S">SR Eddy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Price, Mn" uniqKey="Price M">MN Price</name>
</author>
<author>
<name sortKey="Dehal, Ps" uniqKey="Dehal P">PS Dehal</name>
</author>
<author>
<name sortKey="Arkin, Ap" uniqKey="Arkin A">AP Arkin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Markowitz, Vm" uniqKey="Markowitz V">VM Markowitz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Csardi, G" uniqKey="Csardi G">G Csardi</name>
</author>
<author>
<name sortKey="Nepusz, T" uniqKey="Nepusz T">T Nepusz</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations>
<list>
<country>
<li>Canada</li>
<li>États-Unis</li>
</country>
</list>
<tree>
<country name="États-Unis">
<noRegion>
<name sortKey="Sharpton, Thomas J" sort="Sharpton, Thomas J" uniqKey="Sharpton T" first="Thomas J" last="Sharpton">Thomas J. Sharpton</name>
</noRegion>
<name sortKey="Eisen, Jonathan A" sort="Eisen, Jonathan A" uniqKey="Eisen J" first="Jonathan A" last="Eisen">Jonathan A. Eisen</name>
<name sortKey="Eisen, Jonathan A" sort="Eisen, Jonathan A" uniqKey="Eisen J" first="Jonathan A" last="Eisen">Jonathan A. Eisen</name>
<name sortKey="Eisen, Jonathan A" sort="Eisen, Jonathan A" uniqKey="Eisen J" first="Jonathan A" last="Eisen">Jonathan A. Eisen</name>
<name sortKey="Eisen, Jonathan A" sort="Eisen, Jonathan A" uniqKey="Eisen J" first="Jonathan A" last="Eisen">Jonathan A. Eisen</name>
<name sortKey="Jospin, Guillaume" sort="Jospin, Guillaume" uniqKey="Jospin G" first="Guillaume" last="Jospin">Guillaume Jospin</name>
<name sortKey="Pollard, Katherine S" sort="Pollard, Katherine S" uniqKey="Pollard K" first="Katherine S" last="Pollard">Katherine S. Pollard</name>
<name sortKey="Pollard, Katherine S" sort="Pollard, Katherine S" uniqKey="Pollard K" first="Katherine S" last="Pollard">Katherine S. Pollard</name>
<name sortKey="Wu, Dongying" sort="Wu, Dongying" uniqKey="Wu D" first="Dongying" last="Wu">Dongying Wu</name>
<name sortKey="Wu, Dongying" sort="Wu, Dongying" uniqKey="Wu D" first="Dongying" last="Wu">Dongying Wu</name>
</country>
<country name="Canada">
<noRegion>
<name sortKey="Langille, Morgan Gi" sort="Langille, Morgan Gi" uniqKey="Langille M" first="Morgan Gi" last="Langille">Morgan Gi Langille</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000468 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000468 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:3481395
   |texte=   Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:23061897" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024